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Streptococcus agalactiae strain 138P was isolated from the kidney of diseased Nile tilapia in Idaho during a 2007 streptococcal 
disease outbreak. The full genome sequence of S. agalactiae 138P is 1,838,701 bp. The availability of this genome will allow com- 
parative genomics analysis to identify genes for antigen discovery and vaccine development. 
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Streptococcus agalactiae (group B streptococcus) is a Gram- 
positive pathogen causing human meningitis (1-3), neonatal 
sepsis (4), and pneumonia (5). In cattle, S. agalactiae is a major 
cause of bovine mastitis, a dominant health disorder affecting 
milk production (6, 7). In fish, S. agalactiae causes meningoen- 
cephalitis (8). In 2001, S. agalactiae was responsible for a massive 
fish kill at the Kuwait Bay, causing a loss of 2,500 metric tons of 
wild mullet {Liza klunzingeri) (9). Large-scale streptococcal out- 
breaks occurred frequently in tilapia farms of China from 2009 to 
201 1, involving >95% of farms, with a cumulative mortality of 30 
to 80% (10-12). Reports from China indicate that the prevalent 
strains of Streptococcus causing disease in tilapia have shifted from 
Streptococcus iniae to S. agalactiae (10-12). In 2007, a streptococ- 
cal disease outbreak occurred in cultured Nile tilapia in Idaho. The 
strain 138P is a representative isolate collected from that disease 
outbreak. 

The genome of S. agalactiae 138P was sequenced using the 
lUumina 1500 HiSeq platform. BioNumerics (Applied Maths) was 
used to de novo assemble a total of 30,870,348 sequence reads with 
an average length of 100 bp. The genomic contigs were used to 
search for sequence homology with deposited genome sequences 
in the GenBank nucleotide database using BLASTn. The first 
6,000 bp of the genome sequence of S. agalactiae 138P shared 
100% identities with the S. agalactiae strain 2-22 genome (acces- 
sion no. F0393392). Therefore, strain 2-22 was used as a reference 
genome for this study. The whole genome of S. agalactiae 138P is 
1,838,701 bp with a G+C content of 35.5%. The PAST server (13) 
predicted 1,891 coding sequences, including 305 involved in car- 
bohydrate catabolism, 160 in protein metabolism, 141 in synthesis 
of amino acids and derivatives, 135 in cell wall and capsule syn- 
thesis, 103 in RNA metabolism, 101 in DNA metabolism, 95 in 
cofactors, 89 in nucleoside and nucleotide synthesis, 72 in fatty 
acid synthesis, 65 in membrane transport, 5 1 in virulence, 5 1 in 
stress response, 35 in phosphorus metabolism, 30 in cell division 
and cell cycle, 25 in regulation and cell signaling, 16 in iron acqui- 
sition and metabolism, 15 in potassium metabolism, and 3 in 
motility and chemotaxis. RAST also predicted 5 copies of 5S 
rRNA, 6 copies of 16S rRNA, and 6 copies of 23S rRNA. 



Based on comparison with the genome of S. agalactiae 2-22, 
S. agalactiae 138P had the following three added features: (i) a 
144-bp sequence between 524,842 nucleotides (nt) and 524,985 nt 
coding for a hypothetical protein, (ii) a 198-bp sequence between 
1,006,860 nt and 1,007,057 nt coding for a transcriptional regula- 
tor belonging to the Cro/Cl family, and (iii) a 393-bp sequence 
between 1,009,336 nt and 1,008,944 nt coding for a hypothetical 
protein. 

Nucelotide sequence accession number. The complete ge- 
nome sequence of S. agalactiae 138P was deposited in GenBank 
under the accession no. CP007482. 
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